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This paper reports on an all-sky search for periodic gravitational waves from sources such as de- 
formed isolated rapidly-spinning neutron stars. The analysis uses 840 hours of data from 66 days of 
the fifth LIGO science run (S5). The data was searched for quasi- monochromatic waves with fre- 
quencies / in the range from 50 Hz to 1500 Hz, with a linear frequency drift / (measured at the solar 
system barycenter) in the range —f/r < f < 0.1 //r, for a minimum spin-down age r of 1 000 years 
for signals below 400 Hz and 8 000 years above 400 Hz. The main computational work of the search 
was distributed over approximately 100 000 computers volunteered by the general public. This large 
computing power allowed the use of a relatively long coherent integration time of 30 hours while 
searching a large parameter space. This search extends Einstein@Home's previous search in LIGO 
S4 data to about three times better sensitivity. No statistically significant signals were found. In 
the 125 Hz to 225 Hz band, more than 90% of sources with dimensionless gravitational-wave strain 
tensor amplitude greater than 3 x 10~^* would have been detected. 

PACS numbers: 04.80.Nn, 95.55.Ym, 97.60.Gb, 07.05.Kf 



I. INTRODUCTION 

Gravitational waves (GW) are predicted by Einstein's 
general theory of relativity, but have so far eluded direct 
detection. The Laser Interferometer Gravitational- wave 
Observatory (LIGO) [Ull] has been built for this purpose 
and is currently the most sensitive gravitational-wave de- 
tector in operation. 

Rapidly rotating neutron stars are expected to gener- 
ate periodic gravitational-wave signals through various 
mechanisms [31 IH IHl El IZl HI IS] • Irrespective of the emis- 
sion mechanism, these signals are quasi-monochromatic 
with a slowly changing intrinsic frequency. Additionally, 
at a terrestrial detector, such as LIGO, the data analysis 
problem is complicated by the fact that the periodic GW 
signals are Doppler modulated by the detector's motion 
relative to the solar system barycenter (SSB). 

A previous paper |10| reported on the results of the 
Einstein@Home search for periodic GW signals in the 
data from LIGO's fourth science run (S4). The present 
work extends this search, using more sensitive data from 
66 days of LIGO's fifth science run (S5). 

Because of the weakness of the GW signals buried in 



the detector noise, the data analysis strategy is criti- 
cal. A powerful detection method is given by coherent 
matched-filtering. This means one convolves all avail- 
able data with a set of template waveforms corresponding 
to all possible putative sources. The resulting detection 
statistic is derived in Ref. |llj and is commonly referred 
to as the J-"-statistic. 

The parameter space to be scanned for putative sig- 
nals from isolated neutron stars is four-dimensional, with 
two parameters required to describe the source sky posi- 
tion using standard astronomical equatorial coordinates 
a (right ascension) and 6 (declination), and additional 
coordinates (/, /) denoting the intrinsic frequency and 
frequency drift. To achieve the maximum possible sen- 
sitivity, the template waveforms must match the source 
waveforms to within a fraction of a cycle over the en- 
tire observation time (months or years for current data 
samples). So one must choose a very closely spaced grid 
of templates in this four-dimensional parameter space. 
This makes the computational cost of the search very 
high, and therefore limits the search sensitivity [T^ . 

To maximize the possible integration time, and hence 
achieve a more sensitive search, the computation was 
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distributed via the volunteer computing project Ein- 
stein@Home |13j . This large computing power allowed 
the use of a relatively long coherent integration time of 
30 h, despite the large parameter space searched. Thus, 
this search involves coherent matched-filtering in the 
form of the JF-statistic over 30-hour-long data segments 
and subsequent incoherent combination of J^-statistic re- 
sults via a coincidence strategy. 

The methods used here are further described in Sec- 
tions [ll|lV[ Estimates of the sensitivity of this search 
and results are in Sections |V] and |VI[ respectively. Pre- 
viously, other all-sky searches for periodic GW sources 
using LIGO S4 and S5 data, which combine power from 
many short coherent segments (30-minute intervals) of 
data, have been reported by the LIGO Scientific Collab- 
oration (LSC) [m US]. However, this Einstein@Home 
search explores large regions of parameter space which 
have not been analyzed previously with LIGO S5 data. 
The sensitivity of the results here are compared with pre- 



TABLE I: Segments of early S5 data used in this search. The 
columns are the data segment index j, the GPS start time tj 
and the time spanned Tspanj . 



vious searches in Section VII and conclusions are given 
in Section IVml 



II. DATA SELECTION AND PREPARATION 

The data analyzed in the present work was collected 
between November 19, 2005 and January 24, 2006. 
The total data set covering frequencies from 50 Hz to 
1500 Hz consisted of 660 h of data from the LIGO Han- 
ford 4-km (HI) detector and 180 h of data from the LIGO 
Livingston 4-km (LI) detector. 

The data preparation method is essentially identical 
to that of the previous S4 analysis [10] . Therefore only a 
brief summary of the main aspects is given here; further 
details are found in [lO' and references therein. The data 
set has been divided into segments of 30 h each. However, 
the 30-hour long data segments are not contiguous, but 
have time gaps. Since the number of templates required 
increases rapidly with observation span, the 30 h of data 
for each segment were chosen to lie within a time span of 
less than 40 h. In what follows, the notion of "segment" 
will always refer to one of these time stretches, each of 
which contains exactly T = 30 h of data. The total time 
spanned by a given data segment j is denoted by Tgpanj 
and conforms to 30h < Tgpanj < 40 h. 

Given the above constraints, a total of A'scg — 28 data 
segments (22 from HI, 6 from LI) were obtained from 
the early S5 data considered. These data segments are 
labeled hy j = 1, . . . , 28. Table|T]lists the global position- 
ing system (GPS) start time along with the time span of 
each segment. 

In this analysis, the maximum frequency shift of a 
signal over the length of any given data segment and 
parameter-space range examined is dominated by the 
Doppler modulation due to the Earth's orbital motion 
around the solar system barycenter (SSB), while the ef- 
fects of frequency change resulting from intrinsic spin- 
down of the source are smaller. The orbital velocity of 
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the Earth is about u/c « 10^^, hence a signal will always 
remain in a narrow frequency band smaller than ±0.15 Hz 
around a given source frequency. Therefore, for each de- 
tector the total frequency range from 50 Hz to 1500 Hz is 
broken up into 2900 slices, each of 0.5 Hz bandwidth plus 
overlapping wings of 0.175 Hz on either side. 

The detector data contains numerous narrow-band 
noise artifacts, so-called "lines", which are of instrumen- 
tal origin, such as harmonics of the 60 Hz mains fre- 
quency. Prior to the analysis, line features of understood 
origin (at the time before the launch of the search) were 
removed ("cleaned") from the data by substitution of 
the frequency-domain data bins with random Gaussian 
noise. Table [Hi] in the Appendix shows the frequencies of 
lines excluded from the data. The harmonic mean noise 
strain amplitude spectra of the final cleaned HI and LI 
data sets are shown in Fig. [l] 



III. DATA PROCESSING 

The paper describing the previous Einstein@Home 
search in S4 data [TD| presented in detail the data pro- 
cessing scheme. For the purpose of the present search the 
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FIG. 1: Strain amplitude spectral densities ^ Sh(f) of the 
cleaned data from the LIGO detectors HI (top) and LI (bot- 
tom) used in the EinsteinQHome searches. The curves in the 
top (bottom) panel are the harmonic mean of the 22 HI (6 
LI) 30-hour segments of S5 data used this Einstein@Home 
analysis. 

same data processing infrastructure is employed. Hence, 
here only a short summary thereof is given, pointing out 
the minimal changes applied in setting up the present 
analysis. 

The total computation of the search is broken up 
into 16 446 454 workunits. Each workunit represents 
a separate computing task and is processed using the 
Berkeley Open Infrastructure for Network Computing 
(BOINC) [nUmilH]- To ehminate errors and weed out 
results that are wrong, each workunit is independently 
processed by at least two different volunteers. Once two 
successful results for a workunit are returned back to 
the Einstein@Home server, they are compared by an au- 
tomatic validator, which discards results that differ by 
more than some allowed tolerance. New workunits are 
generated and run independently again for such cases. 

In searching for periodic gravitational-wave signals, 
each workunit examines a different part of parameter 
space. A key design goal is that the computational ef- 
fort to conduct the entire analysis should take about 6-7 
months. An additional design goal is to minimize the 
download burden on the Einstein@Home volunteers' in- 
ternet connections and also on the Einstein@Home data 
servers. This is accomplished by letting each workunit 
use only a small re-usable subset of the total data set, 
so that Einstein@Home volunteers are able to carry out 
useful computations on a one-day time scale. 

Each workunit searches only one data segment over a 



narrow frequency range, but covering all of the sky and 
the entire range of frequency derivatives. The workunits 
are labeled by three indices {j, k, t), where j — 1, . . . , 28 
denotes the data segment, k — 1, . . . , 2900 labels the 
0.5 Hz frequency band and M(j, k) enumerates 

the individual workunits pertinent to data segment j and 
frequency band k. 

In each segment the J^-statistic is evaluated on a grid 
in parameter space. Each parameter-space grid is con- 
structed such that grid points (templates) are not further 
apart from their nearest neighbor by more than a certain 
distance. The distance measure is defined from a metric 
on parameter space, first introduced in |19l 120] . repre- 
senting the fractional loss of squared signal-to-noise ratio 
(SNR^) due to waveform mismatch between the putative 
signal and the template. For any given workunit, the 
parameter-space grid is a Cartesian product of uniformly- 
spaced steps df in frequency, uniformly-spaced steps df 
in frequency derivative, and a two-dimensional sky grid, 
which has non-uniform spacings determined by the met- 
ric [iniiii]. 

For frequencies in the range [50, 400) Hz, the maximal 
allowed mismatch was chosen as to = 0.15 (correspond- 
ing to a maximal loss in SNR^ of 15%), while in the range 
[400, 1500) Hz, the maximal mismatch was to — 0.4. It 
can be shown [lOl |2Tj , that these choices of maximal mis- 
match enable a coherent search of near-optimal sensitiv- 
ity at fixed computational resources. 

The step-size in frequency / obtained from the 
metric depends on Tspanj" of the j'th data seg- 
ment: dfj = 2\/3m/{7rTsps.n.j)- In the low-frequency 
range this results in frequency spacings in the range 
dfj G [2.97, 3.67] /iHz, while for high-frequency workunits 
dfj e [4.85, 6.0] ^Hz. 

The range of frequency derivatives / searched is de- 
fined in terms of the "spin-down age" r = — ///, namely 
T > 1000 years for low-frequency and r > 8 000 years for 
high-frequency workunits. As in the S4 Einstein@Home 
search, these ranges were guided by the assumption that 
a nearby very young neutron star would correspond to 
a historical supernova, supernova remnant, known pul- 
sar, or pulsar wind nebula. The search also covers a 
small "spin-up" range, so the actual ranges searched are 
/ G [— //t, 0.1//r]. In / the grid points are spaced ac- 
cording to dfj — 12\/5m/{TTT^p^^ j), resulting in resolu- 
tions dfj e [1.60, 2.44] X 10~-^°Hz/s for low-frequency 
workunits, and dfj e [2.61, 3.99] x lO^^^Hz/s for high- 
frequency workunits. 

The resolution of the search grid in the sky depends 
both on the start time tj and duration Tspanj of the 
segment, as well as on the frequency /. The number of 
grid points on the sky scales as oc and approximately 
as oc T^ptn.j for the range of Tspanj ~ 30 — 40h used in 
this search. As was done in the previous S4 analysis [TO] . 
to simplify the construction of workunits and limit the 
number of different input files to be sent, the sky grids 
are fixed over a frequency range of 10 Hz, but differ for 
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each data segment j. The sky grids are computed at the 
higher end of each 10-Hz band, so they are shghtly "over- 
covering" the sky at lower frequencies within the band. 
The search covers in total a frequency band of 1450 Hz, 
so there are 145 different sky grids for each segment. 

The output from one workunit in the low (high) fre- 
quency range contains the top 1000 (10 000) candidate 
events with the largest values of the JF-statistic. In order 
to balance the load on the Einstein@Home servers, a low- 
frequency workunit returns a factor of 10 fewer events, 
because low-frequency workunits require runtimes ap- 
proximately 10 times shorter than high-frequency worku- 
nits. For each candidate event five values are reported: 
frequency (Hz), right ascension angle (radians), declina- 
tion angle (radians), frequency derivative (Hz/s) and 2J-' 
(dimensionless). The frequency is the frequency at the 
SSB at the instant of the first data point in the cor- 
responding data segment. Returning only the "loud- 
est" candidate events effectively corresponds to a float- 
ing threshold on the value of the .F-statistic. This avoids 
large lists of candidate events being produced in regions 
of parameter space containing non-Gaussian noise, such 
as instrumental artifacts that were not removed a priori 
from the input data because of unknown origin. 

IV. POST-PROCESSING 

After results for each workunit are returned to the 
Einstein@Home servers by project volunteers, post- 
processing is conducted on those servers and on dedicated 
computing clusters. The post-processing has the goal of 
finding candidate events that appear in many of the 28 
different data segments with consistent parameters. 

In this search, the post-processing methods are the 
same as used for the Einstein@Home S4 search [10]. 
Therefore, this section only summarizes the main steps; 
a more detailed description can be found in [101 . 

A consistent (coincident) set of "candidate events" is 
called a "candidate". Candidate events from different 
data segments are considered coincident if they clus- 
ter closely together in the four-dimensional parameter 
space. By using a grid of "coincidence cells", the clus- 
tering method can reliably detect strong signals, which 
would produce candidate events with closely-matched pa- 
rameters in many of the 28 data segments. The post- 
processing pipeline operates in 0.5 Hz- wide frequency 
bands, and performs the following steps described below. 

A. The post-processing steps 

A putative source with non-zero spin-down would gen- 
erate candidate events with different apparent frequency 
values in each data segment. To account for these effects, 
the frequencies of the candidate events are shifted back 
to the same frequency value at fiducial time ifiduciai via 
/(^fiducial) = f{tj) + (^fiducial " tj) /, whcrc / and f{tj) 



are the spin-down rate and frequency of a candidate 
event reported by the search code in the result file, 
and tj is the time-stamp of the first datum in the 
j'th data segment. The fiducial time is chosen to be 
the GPS start time of the earliest (j = 1) data segment, 
^fiducial = ti = 816 397 490 s. 

A grid of cells is then constructed in the four- 
dimensional parameter space to find coincidences among 
the 28 different data segments. The coincidence search 
algorithm uses rectangular cells in the coordinates 
{f,f,acos6,6). The dimensions of the cells are adapted 
to the parameter-space search grid (see below). Each 
candidate event is assigned to a particular cell. In cases 
where two or more candidate events from the same data 
segment j fall into the same cell, only the candidate event 
having the largest value of 2JF is retained in the cell. 
Then the number of candidate events per cell coming 
from distinct data segments is counted, to identify cells 
with more coincidences than would be expected by ran- 
dom chance. 

To ensure that candidate events located on opposite 
sides of a cell border are not missed, the entire cell coin- 
cidence grid is shifted by half a cell width in all possible 
2* = 16 combinations of the four parameter-space dimen- 
sions. Hence, 16 different coincidence cell grids are used 
in the analysis. 

B. Construction of coincidence windows 

The coincidence cells are constructed to be as small as 
possible to reduce the probability of false alarms. How- 
ever, since each of the 28 different data segments uses a 
different parameter space grid, the coincidence cells must 
be chosen to be large enough that the candidate events 
from a source (which would appear at slightly different 
points in parameter space in each of the 28 data seg- 
ments) would still lie in the same coincidence cell. 

In the frequency direction, the size A/ for the co- 
incidence cell is given by the largest search grid spac- 
ing in / (for smallest value of Tgpanj) plus the largest 
possible offset in spin-down: A/ = maxj (dfj + At dfj), 
where the maximization over j selects the data seg- 
ment with the smallest Tsp^n.j (which is j = 6) and 
At = I maxj tj — miuj tj \ = t22 — ti = 5 576 214 s is the 
total time span between the latest and earliest data seg- 
ments. For safety, e.g. against noise fluctuations that 
could shift a candidate peak. A/ has been increased by 
a further 30%, so that the width of the coincidence cell 
in / below 400 Hz is A/ = 1.78 niHz and A/ = 2.9 mHz 
above 400 Hz. 

In the frequency-derivative direction, the size of the 
coincidence cell is given by the largest dfj spacing in the 
parameter space grid, which is also determined by the 
smallest value of Tspan j" • For safety this is also increased 
by 30%, so that A/ = 3.18 x lO^^" Hz s"^ below 400 Hz 
and A/ = 5.19 x 10-^° Hz s"^ above 400 Hz . 

In sky position, the size of the coincidence cells 
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FIG. 2: The parameters Aa(0) and AS{0) of the sky 
coincidence-window model as a function of the 10 Hz fre- 
quency band. The vertical dashed line at 400 Hz indicates 
the separation between the low and high frequency ranges. 



is guided by the behavior of the parameter-space 
metric. As described in |10j . the density of grid 
points in the sky is approximately proportional to 
|cos(i5) sin(5)| c>c |sin(2(5)|, and it follows from [TU' that 
cos((5) da = I sin((5)| dS — const. Because of the singular- 
ity when 5 — *■ 0, a useful model for the coincidence win- 
dow size varying with declination is given by 



Aa{S) = Aa(0)/cos((5) 

;(o) 

i(0)/|sin( 



(1) 



if \S\ < 6c 
KAa(0))| ii\S\>dc 



To ensure continuity at S = Sc, the transition point Sc is 
defined by the condition Aq;(0)/| sin(|(5c| — kAq!(0))| — 
A(5(0). The tuning parameter n is chosen based on vi- 
sual inspection to be n = 1.5 in this search. The values 
of Aa(0) and A(5(0) are directly determined from the sky 
grids (see [TO] for details). Figure [2] shows these parame- 
ters for all sky grids as a function of frequency. As stated 
above, the sky grids are constant for lOHz-wide steps in 
frequency, and so these parameters vary with the same 
step-size. 



C. Output of the post-processing 

The output of the post-processing is a list of the can- 
didates with the greatest number of coincidences. The 
possible number of coincidences ranges from a minimum 
of to a maximum of 28 (the number of data segments 
analyzed) . The meaning of C coincidences is that there 
are C candidate events within a given coincidence cell. 
In each frequency band of coincidence-window width A/, 
the coincidence cell containing the largest number of can- 
didate events is found. The pipeline outputs the average 



frequency of the coincidence cell, the average sky position 
and spin-down of the candidate events, the number of 
candidate events in the coincidence cell, and the "signifi- 
cance" of the candidate. The significance of a candidate, 
first introduced in ^ and explained in [TO], is defined 

by 



'5 = E(-^9-Ml + -^g)), 



(2) 



where J-q is the JF-statistic value of the q'th candidate 
event in the same coincidence cell, which harbors a total 
of C candidate events. 



D. False alarm probability and detection threshold 

The central goal of this search is to make a confident 
detection, not to set upper limits with the broadest pos- 
sible coverage band. This is reflected in the choice of 
detection threshold based on the expected false alarm 
rates. In this search the background level of false alarm 
candidates is expected at 10 coincidences (out of 28 pos- 
sible). As a pragmatic choice, the threshold of confident 
detection is set at 20 coincidences, which is highly im- 
probable to arise from random noise only. These settings 
will be elucidated in the following. 

To calculate the false alarm probabilities, consider the 
case where £scg{k) candidate events per data segment 
obtained from pure Gaussian noise are distributed uni- 
formly about A'coii(fc) independent coincidence cells in a 
given 0.5 Hz band k. Assuming the candidate events are 
independent, the probability PF(fc;Cniax) per coincidence 
cell of finding Cmax or more candidate events from differ- 
ent data segments has been derived in [TO] and is given 
by the binomial distribution 



PF(fc;Cmax) 



E 



SGg 

n 



E(fc)]"[l-e(fc)] 



, (3) 



where e(fc) denotes the probability of populating any 
given coincidence cell with one or more candidate events 
in a given data segment, obtained as 



e(fc) 



A^cell(fc) 



(4) 



Finally, the probability i-F(fc;Cniax) that there are Cmax 
or more coincidences in one or more of the A'coii cells per 
0.5 Hz band k is 



PF(fc;C„iax) = 1 - [1 -?'F(fc;Cmax)] 



(5) 



Figure [3] shows the dependence of i-'F(fc;Cniax) on the 
frequency bands for different values of Cmax- One finds 
that the average false alarm probability of obtaining 10 
or more coincidences is approximately 10"'^. This means, 
in our analysis of 2 900 half-Hz frequency bands, only a 
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FIG. 3: False alarm probabilities PF(A:;Cmax) as a func- 
tion of frequency band (labeled by k) for different values of 
Cmax G {10, 14, 17, 20, 25}. The dashed horizontal hues rep- 
resent the corresponding average across all frequencies. The 
vertical dashed line at 400 Hz indicates the separation between 
the low and high frequency ranges. 



FIG. 4: Estimated sensitivity of the EinsteinQHome search 
for isolated periodic GW sources in the early LIGO S5 data. 
The set of three curves shows the source strain amplitudes 
ho at which 10% (bottom), 50% (middle) and 90% (top) of 
simulated sources would be confidently detected in this Ein- 
steinQHome search. 



few candidates are expected to have 10 or more coin- 
cidences. Thus this will be the anticipated background 
level of coincidences, because from pure random noise 
one would not expect candidates of more than 10 coin- 
cidences in this analysis. In contrast, the false alarm 
probability of reaching the detection threshold of 20 or 
more coincidences per 0.5 Hz averaged over all frequency 
bands is about 10~^^. Therefore, this choice of detection 
threshold makes it extremely improbable to be exceeded 
in case of random noise. 



V. ESTIMATED SENSITIVITY 

The methods used here would be expected to yield 
very high confidence if a strong signal were present. To 
estimate the sensitivity of this detection scheme, Monte- 
Carlo methods are used to simulate a population of 
sources. The goal is to find the strain amplitude Hq at 
which 10%, 50%, or 90% of sources uniformly populated 
over the sky and in their "nuisance parameters" would 
be confidently detected. In this analysis, "detectable" 
means "produces coincident events in 20 or more dis- 
tinct data segments" . As discussed above, the false alarm 
probability for obtaining such a candidate in a given 
0.5 Hz band is of order 10~^^. This is therefore an esti- 
mate of the signal strength required for high-confidence 
detection. For this purpose, the pipeline developed in |10l 
is run here, using the input data of the present analy- 
sis. A large number of distinct simulated sources (tri- 
als) are tested for detection. A "trial" denotes a single 
simulated source which is probed for detection. For a 
detailed description of the methodology, the reader is re- 



ferred to [TO] . 

Figure |4] shows the resulting search sensitivity curves 
as functions of frequency. Each data point on the plot de- 
notes the results of 1 000 independent trials. These show 
the values of Hq as defined in [TT] such that 10%, 50%, 
and 90% of simulated sources are confidently detected in 
the post-processing pipeline. 

The dominant sources of error in these sensitivity 
curves are uncertainties in calibration of the LIGO de- 
tector response functions (cf. [101 [H]). The uncertainties 
range typically from about 8% to 15%, depending on fre- 
quency. 

The behavior of the curves shown in Fig.H essentially 
reflects the instrument noise given in Fig. [H^ One may 
fit the curves obtained in Fig. [4] to the shape of the 
harmonic- mean averaged strain noise power spectral den- 
sity 5/i(/). Then the three sensitivity curves in Fig.|4]are 
described by 



^oif)^Rv\[^, (6) 



where the pre-factors R-d for different detection proba- 
bilities levels V = 90%, 50%, and 10% are well fit below 
400 Hz by i?90% = 29.4, i?5o% = 18.5, and i?io% = 11.6, 
and above 400 Hz by i?9o% = 30.3, -R5o% — 19.0, and 
-^10% = 11-8- 
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VI. RESULTS 
A. Vetoing instrumental noise lines 

At the time the instrument data was prepared and 
cleaned, narrow-band instrumental line features of known 
origin were removed, as previously described in Sec. [TTl 
However, the data also contained stationary instrumen- 
tal line features that were not understood, or were poorly 
understood, and thus were not removed a ■priori. After 
the search had been conducted, at the time the post- 
processing started, the origin of more stationary noise 
lines became known. Therefore, these lines, whose origin 
was tracked down after the search, are excluded (cleaned 
a posteriori) from the results. A list of the polluted fre- 
quency bands which have been cleaned a posteriori is 
shown in Tab. |IV| in the Appendix. 

However, noise features still not understood instru- 
ment ally at this point were not removed from the results. 
As a consequence, the output from the post-processing 
pipeline contains instrumental artifacts that in some re- 
spects mimic periodic GW signals. But these artifacts 
tend to cluster in certain regions of parameter space, and 
in many cases they can be automatically identified and 
vetoed as done in previous searches |10lE5] . The method 
used here is derived in [21] and a detailed description of 
its application is found in |10j . 

For a coherent observation time baseline of 30 h the 
parameter-space regions where instrumental lines tend 
to appear are determined by global-correlation hypersur- 
faces of the J^-statistic. On physical grounds, in these 
parameter-space regions there is little or no frequency 
Doppler modulation from the Earth's motion, which can 
lead to a relatively stationary detected frequency. Thus, 
the locations of instrumental-noise candidate events are 
described by 



/ + / 



(7) 



where c denotes the speed of light, n is a unit vector 
pointing to the source's sky-location in the SSB frame 
and relates to the equatorial coordinates a and 5 by 
n = (cos 5 cos a, cos 5 sin a, sin (5), Vj is the orbital ve- 
locity of the Earth at the midpoint of the j'th data 



segment ( | Vj 



10 c). The parameter e accounts 



for a certain tolerance needed due to the parameter- 
space gridding and can be understood as e = Af /N^ AT, 
where A/ denotes width in frequency (corresponding to 
the coincidence-cell width in the post-processing) up to 
which candidate events can be resolved during the char- 
acteristic length of time AT, and Nc represents the size 
of the vetoed or rejected region, measured in coincidence 
cells. In this analysis AT — 5 718 724 s (« 66 days) is the 
total time interval spanned by the input data. 

Because false alarms are expected at the level of 10 co- 
incidences, candidates that satisfy Eq. (|7| for more than 
10 data segments are eliminated (vetoed)?^ The fraction of 
parameter space excluded by this veto is determined by 



Monte-Carlo simulations to be about 13%. From Eq. ([t]) 
it follows that for fixed frequency the resulting fraction of 
sky excluded by the veto (uniformly averaged over spin- 
down) is greatest at lowest frequencies and decreases ap- 
proximately as for higher frequencies. Appendix A 
of Ref. [To] presents an example calculation, illustrat- 
ing the parameter-space volume excluded by this vetoing 
method. 



B. Hardware-injected signals 

During parts of the LIGO S5 run ten simulated peri- 
odic GW signals were injected at the hardware level, by 
modulating the interferometer mirror positions via sig- 
nals sent to voice actuation coils surrounding magnets 
glued near the mirror edges. The hardware injections 
were active only part of the time during the S5 science 
run; in only 12 (of the 28) data segments chosen for 
this search did the hardware injections have duty cycles 
greater than 90%. But the value of 12 coincidences is far 
below the detection condition. Therefore, the hardware 
injections are not expected to be detected in this search, 
simply because they were inactive during a large fraction 
of the data analyzed. 



C. Post-processing results 

Figures |5] and [6] summarize all post-processing re- 
sults from the entire search frequency range of 50 Hz to 
1500 Hz, for each frequency coincidence cell maximized 
over the entire sky and full spin-down range. 

In Fig. 5(a) all candidates that have 7 or more coinci- 



dences are shown in a sky projection. The color scale is 
used to indicate the number of coincidences. The most 
prominent feature still apparent forms an annulus of high 
coincidences in the sky, including the ecliptic poles, a dis- 
tinctive fingerprint of the instrumental noise lines [24]. 
To obtain the results shown in Fig. |5(b)| the set of can- 
didates is cleaned a posteriori by removing strong in- 
strumental noise lines, whose origin became understood 
after the search was begun, and excluding the hardware 
injections. Finally, in Fig. 5(c) the parameter-space veto 



is applied and coincidence cells which contain candidate 
events from a single detector only are excluded, too. 



In Fig. 6(a) the coincidences and significance of all can- 



didates that have 7 or more coincidences are shown as a 
function of frequency. From this set of candidates the 
hardware injections are excluded, strong instrumental 
noise lines of known origin are removed, the parameter- 
space veto is applied and finally single-detector candi- 
dates are excluded to obtain Fig. 6(b)| 



As can be seen from Figs. 5(c) 



and |6(b)| there are no 
candidates that exceed the predefined detection thresh- 
old of 20 coincidences (which would initiate more a exten- 
sive investigation). The largest number of coincidences 
found is 10, which is at the background level of false 
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FIG. 5: Sky maps of post-processing results. Candidates 
having more than 7 coincidences are shown in Hammer- AitofF 
projections of the sky. The color-bar indicates the number of 
coincidences of a particular candidate (cell). The top plot (a) 
shows the coincidence analysis results. In (b) a posteriori 
strong lines of known instrumental origi n an d hardware in- 
jections are removed. The bottom plot (c) is obtained by 
additionally applying the parameter-space veto and exclud- 
ing single-detector candidates. 
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The top plot (a) shows the post-processing can- 
didates having more than 7 coincidences as function of fre- 
quency. The light-gray shaded rectangular regions highlight 
the frequency bands of the hardware injections. The dark- 
gray data points show the candidates resulting from the 
hardware-injected GW signals. In (b) the final results are 
shown after exclusion of instrumental lines of known origin 
and hardware injections, application of parameter-space veto 
and exclusion of single-detector candidates. 



VII. COMPARISON WITH PREVIOUS 
SEARCHES 



alarms expected from random noise only. From these 
candidates having 10 coincidences, Table [II] lists the ten 
most significant ones. 



A previous paper [TU] reported on the results of the 
Einstein@Home search for periodic GW signals in the 
LIGO S4 data. The present work extends this search 
analyzing more sensitive LIGO S5 data while using the 
same methods described in [TO]. Therefore, this section 
elucidates the changes in configuration of the search and 
post-processing. 

First, not only is more sensitive data used here, but a 
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TABLE II: The ten most significant post-processing candidates that have 10 or more coincidences. The frequency of each 
candidate /cand refers to the fiducial GPS time teduciai = 816397490s. The parameters 5cand, Qcand, /cand, Ccand = CfiLd+Ccstnd 
and iScand are for the most-significant, most-coincident candidate with the given frequency of /cand, where C^J^d ^-nd C^^^d denote 
the number of coincidences from detectors HI and LI, respectively. 
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larger total volume of data is searched compared to [10] . 
The number of 30-h data segments analyzed increased 
from 17 to 28. 

In addition, the template grids used in each data seg- 
ment of this search were constructed to be denser, reduc- 
ing the possible loss of signals due to mismatch in the 
template waveforms. Compared to the previous search 
in S4 data, where a maximal mismatch of m — 0.2 
(m = 0.5) was used in the low (high) frequency range, 
here templates are placed on a grid of higher density us- 
ing m = 0.15 (m = 0.4) in the low (high) frequency 
range. 

Moreover, in the high-frequency range a larger range 
of possible spin-downs is searched. The S4 analysis 
searched over minimum spin-down ages greater than 
10 000 yr for frequencies in the higher range (/ > 300 Hz) , 
whereas this analysis searches over minimum spin-down 
ages greater than SOOOyr for frequencies in the higher 
range (/ > 400 Hz). The different partitioning of fre- 
quencies into the low and high ranges (split at 300 Hz in 
S4, split at 400 Hz here) is a consequence of an optimiza- 
tion study reflecting the overall most sensitive search at 
given computing power. 

This search presented here analyzed in total about 
three times more workunits than in the S4 search. In 
searching the S4 data, each workunit returned the top 
13 000 candidate events, whereas this search is designed 
to keep only the top 1000 (10 000) candidate events in 
the low (high) frequency range. This configuration has 
the purpose of balancing the load on the Einstein@Home 
servers, which receive the workunit results. A low- 
frequency workunit returns a factor of 10 fewer events, 
because these were designed to last approximately 10 
times less than each high-frequency workunit. 

Finally, based on the estimates presented in Sec. |V] 
the present search is overall about a factor of three more 
sensitive than the previous S4 search. This improvement 
is a consequence of using more sensitive detector data in 
combination with a finer-spaced template bank. 

The methods used here, as well as in the S4 paper, 
would be expected to give very high confidence if a strong 
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FIG. 7: Comparison of search parameter spaces in the plane 
of frequency and frequency derivative. The dark-gray region 
refers to this Einstein@Home all-sky analysis in early LIGO 
S5 data. The light-grey area corresponds to the recent all-sky 
PowerFlux search [T^ in early LIGO S5 data. 



enough signal were present in the data. It is interesting 
to compare the sensitivity of this detection scheme with 
the sensitivity of upper limits such as presented recently 
in [T5]. Based on the PowerFlux method [23], that anal- 
ysis set strain upper limits at the 95% confidence level in 
the frequency range of 50 - 1100 Hz and the frequency- 
derivative range of— 5 x 10"^ — OHzs"^ using 7 147h of 
early LIGO S5 data, about 8.5 times more data than was 
used here. Note that this Einstein@Home search explores 
substantially larger parts of parameter space in frequency 
and frequency derivative, as shown in Fig. [7] 

The upper-limit worst-case results of [TS] for the 
equatorial sky region are remarkably close to the 90%- 
detection-level /iq- values of Fig. |4] However, these Pow- 
erFlux upper limits refer to the most unfavorable polar- 
ization and sky position. A population-based upper limit 
over all sky locations and polarizations would be lower. 



11 



On the other hand, another key difference between the 
PowerFlux upper hmits procedure and the sensitivity es- 
timation carried out here is the detection criteria. In the 
present work;, detection requires a signal to generate 20 
or more coincidences among the 28 different data seg- 
ments. This corresponds to a false alarm probability in 
Gaussian noise of the order 10"^"'^ per 0.5 Hz frequency 
band. This is different from |15j . where simulated signals 
are compared to the strongest candidates found. Thus, 
an equivalent detection criterion for this work would be 
to compare the signals against the strongest candidates 
in each 0.5 Hz band. These are typically 10 coincidences, 
which relates to a Gaussian noise false alarm rate of or- 
der 10^''. One can estimate the effect on sensitivity by 
recomputing the sensitivity estimation of Sec. [V] but re- 
quiring each signal to produce only 10 coincidences. This 
reduces the prefactors Rt> given above by a factor of 1.24. 

Apart from the larger parameter space searched, the 
present analysis is achieving roughly comparable sensitiv- 
ity to [T5] in spite of searching 8.5 times less data. Much 
of this effectiveness is due to the increased coherent in- 
tegration time (30 hours vs 30 minutes), which is only 
possible due to the great amount of computing power 
donated by the tens of thousands of Einstein@Home vol- 
unteers. 



VIII. CONCLUSION 

Using early LIGO fifth-science-run data this paper re- 
ports on the results from the Einstein@Home search for 
unknown periodic GW sources, extending the previous 
Einstein@Home search in LIGO S4 data [lOj. The sensi- 
tivity of the present analysis improves upon the previous 
Einstein@Home S4 search by a factor of about three. Ad- 
ditionally, in large regions of the parameter space probed, 
this analysis yields the currently most sensitive all-sky 
search results for periodic GW sources. 

No credible periodic GW signal was found. Over 
a 100-Hz wide band around the detectors' most sensi- 
tive frequencies, more than 90% of sources with dimen- 
sionless gravitational-wave strain amplitude greater than 
3 X 10~^^ would have been detected. 

While no statistically significant signal was observed 
in this analysis, the results demonstrate the capability of 
public distributed computing to accomplish a sensitive 
periodic GW search for the benefit of future searches. 

The sensitivity of the present analysis is essentially lim- 
ited by the first-stage threshold on JF-statistics forced 
by the limited data volume which can be returned from 
the participating clients. A new Einstein@Home search 



currently underway carries out the incoherent combina- 
tion of J^-statistic results on the client machines (done 
here in the post-processing once results were sent back). 
This makes it possible to set a much lower (sensitivity- 
optimized) first-stage threshold on JF-statistics. Hence, 
results from the new search promise a significant en- 
hancement in the overall sensitivity for a periodic GW 
detection. 
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APPENDIX A: CLEANED INSTRUMENTAL 
NOISE LINES 

Table |III| lists the frequencies of noise lines excluded 
from the data and replaced by Gaussian noise a pri- 
ori to the search. Table |IV| lists the central frequen- 
cies around either side of which the Doppler band 
(A/Li,ic = /Line X lO^"') is o posteriori excluded from the 
post-processed search results. 



[1] A. Abramovici et al, Science 256, 325 (1992). chio, and N. Andersson, Phys. Rev. D 58, 084020 (1998). 

[2] B. C. Barish and R. Weiss, Physics Today 52, 44 (1999). [5] G. Ushomirsky, C. Cutler, and L. Bildsten, Mon. Not. 
[3] L. Bildsten, Ap. J. 501, L89 (1998). Roy. Astron. Soc. 319, 902 (2000). 

[4] B. J. Owen, L. Lindblom, C. Cutler, B. F. Schutz, A. Vec- [6] C. Cutler, Phys. Rev. D 66, 084025 (2002). 



12 



TABLE III: Instrumental noise lines cleaned from HI and LI data. The three columns show the central frequency /Line, the 
bandwidth A/^j^^ removed below the central frequency and the bandwidth A/^fjl^ removed above the central frequency. Thus 

the total bandwidth removed per central frequency is A/^i^j, + A/l^^o- In addition, at each harmonic of the 60 Hz mains 
frequency, the same bandwidth is also removed. A zero bandwidth indicates that the line-cleaning algorithm replaces in these 
cases a single Fourier bin with the average of bins on either side. The spacing between Fourier bins is 1/1800 Hz. 
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TABLE IV: Frequencies of instrumental lines that have been 
excluded a posteriori from the post-processed search results. 
Each column shows the central frequency /Line around which 
a bandwidth of A/Linc = func x 10~* has been removed on 
either side. The cleaned bandwidth corresponds to the max- 
imum possible frequency shift due to the global parameter- 
space correlations ,24. . On physical grounds this is related to 
the maximum possible Doppler shift due to the orbital veloc- 
ity of the Earth, which is approximately 10"* in units of the 
speed of light. 
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